Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Task alerts #721

Merged
merged 18 commits into from Oct 15, 2015
Merged

Task alerts #721

merged 18 commits into from Oct 15, 2015

Conversation

kwm4385
Copy link
Contributor

@kwm4385 kwm4385 commented Oct 6, 2015

Show an alert on the task page if it is a scheduled task that has been running for longer than twice the average for tasks in the request. /cc @tpetr

screen shot 2015-10-06 at 3 59 44 pm

@kwm4385
Copy link
Contributor Author

kwm4385 commented Oct 8, 2015

Alert if a task is killed due to slave decommissioning.

screen shot 2015-10-08 at 11 53 59 am

getAlerts: (requestTaskHistory) =>
task = @models.task
alerts = []
# Is this a scheduled task that has been running much longer than previous ones?
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

instead of calculating this yourself, you should use averageRuntimeMillis in the SingularityDeployStatistics object, which can be grabbed from the deploy history endpoint (.../api/history/request/REQUEST_ID/deploy/DEPLOY_ID). it would also be wise to surface warnIfScheduledJobIsRunningPastNextRunPct from https://github.com/HubSpot/Singularity/blob/master/SingularityService/src/main/java/com/hubspot/singularity/config/SingularityConfiguration.java#L172 into the window.config object so that we don't have to hardcode the threshold value

@kwm4385
Copy link
Contributor Author

kwm4385 commented Oct 14, 2015

Updated

@@ -63,6 +64,8 @@ public String getFinishedTaskLogPath() {
private boolean hideNewDeployButton = false;
private boolean hideNewRequestButton = false;

private Optional<Integer> warnIfScheduledJobIsRunningPastNextRunPct;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i don't think this is necessary -- using the value inside SingularityConfiguration is probably good enough

deployInfo = new DeployDetails
deployId: deployId
requestId: requestId
deployInfo.fetch().success =>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

technically we could only make this call only if it's needed (i.e. if the task is scheduled + currently running)

tpetr pushed a commit that referenced this pull request Oct 15, 2015
@tpetr tpetr merged commit 6d6a23b into master Oct 15, 2015
@tpetr tpetr removed hs_qa labels Oct 15, 2015
@tpetr tpetr deleted the task_alerts branch October 15, 2015 17:54
@tpetr tpetr added this to the 0.4.6 milestone Oct 16, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants